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Oh. Abstract 

We comment on a formulation of quantum statistical mechanics, which incor- 
porates the statistical inference of Shannon. Our basic idea is to distinguish the 
dynamical entropy of von Neumann, H = — kTrplnp, in terms of the density ma- 
trix p(t), and the statistical amount of uncertainty of Shannon, S = —k J2nPn hip n > 
with p n = (n\p\n) in the representation where the total energy and particle num- 



bers are diagonal. These quantities satisfy the inequality S > H. We propose to 
interprete Shannon's statistical inference as specifying the initial conditions of the 
system in terms of p n . A definition of macroscopic observables which are charac- 
terized by intrinsic time scales is given, and a quantum mechanical condition on 



the system, which ensures equilibrium, is discussed on the basis of time averaging. 
An interesting analogy of the change of entroy with the running coupling in renor- 
malization group is noted. A salient feature of our approach is that the distinction 
between statistical aspects and dynamical aspects of quantum statistical mechanics 
q ' is very transparent. 

> ■ 

1 Introduction 

A formulation of statistical mechanics on the basis of Shannon's information theory ' has 
been proposed by Jaynes in 1957. 2 ) This formulation utilizes the least biased statistical 
inference about a physical system on the basis of a limited amount of information available. 

In the present note we discuss quantum statistical mechanics, 3-9 -' which incorporates 
Jaynes's proposal, and we examine what kind of picture appears if one distinguishes 
the statistical aspects and dynamical aspects of quantum statistical mechanics to the 
maximum extent. We work exclusively on quantum statistcal mechanics, though in the 
course of our discussion we comment on the recent progress in the Boltzmann approach 
to classical statistical mechanics . 10_12 ) 

We would like to briefly summarize the basic aspects of our analysis. We examine a 
quantum mechanical mixed state, which is slightly away from thermal equilibrium, in the 
representation where the total energy and particle numbers are diagonal. The average 
value of any macroscopic observable O is given by (0(t)) = TrOp(t). For the system 
slightly away from equilibrium, (0(t)) is time dependent in general, and thus the density 
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matrix p(t) is time dependent. Following von Neumann, we introduce the (dynamical) 
entropy defined by 

H = -kTrp\n P (1.1) 

which is a generalization of Gibbs entropy to a quantum system. It is well known that H 
thus defined is constant in time. 

The basic observation in this note is that we can define another quantity 

S = -kJ2Pn\np n (1.2) 

n 

in terms of p n = (n\p\n) in the representation where the total energy and particle numbers 
are diagonal. By definiton we have p n (t) = p„(0) and thus S is also constant in time. We 
propose to identify S thus defined, which can be regarded as basically statistical quantity, 
as the amount of uncertainty of Shannon which was introduced into statistical mechanics 
by Jaynes. 2 - 1 Note that H in (1.1) agrees with S in (1.2) only when p is diagonalizable 
simultaneously with the total Hamiltonian and the total particle number operator. This 
clear distinction between H in (1.1) and S in (1.2), to our knowledge, has not been made 
in the past. 13 ) 

In the present formulation of quantum statistical mechanics, we regard the least biased 
estimate on the basis of limited amount of information discussed by Jaynes as a least 
biased estimate of initial conditions on the diagonal elements p n . Note that the diagonal 
elements of p(t) are constant in time and thus they cannot approach equilibrium values 
by any dynamical motion but by statistical inference. (Our initial conditions are thus 
final conditions also.) The initial state is thus specified by {p n } and a set of macroscopic 
observables other than the total energy and particle number. From the days of Boltzmann, 
it is well known that the second law of thermodynamics is mechanically represented only 
by means of assumptions regarding initial conditions, 12 ) or by choosing the typical states 
in the classical Boltzmann analysis. u ) In quantum statistical mechanics, as formulated 
here, it is clear that Shannon's statistical inference does not precede physical dynamics. 

As for the main problem of quantum statistical mechanics as to what kind of dynamical 
properties of the system ensure the approach to eventual thermal equilibrium, we propose 
to analyze a set of macroscopic observables, each of which is characterized by an intrinsic 
time scale r. As for the entropy law, some form of coarse graining is necessary not only in 
the Gibbs approach to quantum statistical approach 8 ^ but also in the Boltzmann approach 
to classical statistical mechanics. 10 ) In our approach it turns out to be more convenient to 
take a coarse graining in the "time direction" or a suitable time averaging. 14 ' 9 ) The entropy 
law of Clausius in the present formulation is expressed as the approach of the macroscopic 
observables of the system on a suitable time average to those of the almost equilibrium 
state, whose physical entropy is estimated by the maximum value of Shannon's S. 

2 Shannon's Least Biased Inference 

We consider the variable x which takes the n values {xi,x 2 , ....,x n } and define the prob- 
ability pi for the variable x to assume the value Xj. The no n- negative probability pi is 
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constrained by the condition 

i>=i, (2- 1 ) 

1=1 

which means that the total probability is unity. We also consider a smooth function f(x) 
of the variable x, such as f(x) = x. 

We then ask what we can say about the set of probabilities {pi,P2, , ■■■■,Pn}, if only 
available information is the average value < / > of f(x) defined by 

<f>=f2Pif(xi). (2.2) 
i=i 

Clearly it is impossible to determine all pi uniquely for a large value of n since we know 
only < f >. 

Shannon introduced the notion of amount of uncertainty S(pi,p 2 , ■■■-,p n ) for the set 
of variables {pi,P2, ■■■■,Pn}, and he proposed to determine each pi by allowing the max- 
imum amount of uncertainty, or equivalently, the least bias for the chosen solution of 
{pi,P2, ....,p n }- On the basis of a composition law, Shannon derived the amount of 
uncertainty 1 ^ ( see also Appendix in ref. 2) 

n 

S(p 1 ,p 2 ,....,p n ) = -kJ2Pi^Pi (2-3) 

i=i 

with a positive constant k. 

We now apply the above theory of inference to statistical mechanics. Consider a closed 
system for which one knows that the total energy is confined within a small range 

E - ^AE <E n <E + ^AE (2.4) 

with the constraint 

Y / E nPn = E (2.5) 

n 

for sufficiently small AE (with a fixed particle number N and a fixed volume). The 
maximum of the Shannon's amount of uncertainty then gives rise to the probability for 

E„ 



p n = exp[-PE n ]/Z, 

Z= ]T exp[-pE n ]. (2.6) 

E~±AE<E n <E+±AE 

The parameter f3 = (3(E), which is introduced as a multiplier, is defined by 

This formulation, which exhibits the temperature explicitly, is more convenient than the 
conventional formulation in microcanonical ensemble. 15 - 1 This formulation, if one assumes 
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static equilibrium, is reduced to the microcanonical ensemble in the limit of small AE: 
The equal a priori probabilities are obtained as 



p ex P[-^ = \ ( 28 ) 

Pn ~ AW(E, N) exp[-(3E] AW(E, N) 1 ; 

and the thermodynamic relation 

F = -ilnZ 

~ ~]n{AW(E,N)exp[-/3E\} 

= E--S/k = E-TS. (2.9) 

Here we used the Boltzmann entropy 

S = k\nAW(E,N) - -k^Pn^Pn (2.10) 

n 

where AW(E, N) stands for the number of quantum states within the above energy 
range. Our determination of the time independent quantities p n is thus consistent with 
the microcanonical ensemble for sufficiently small AE. 

In the next section, we discuss a formulation of quantum statistical mechanics which 
incorporates the above statistical inference, and yet a logically consistent formulation in 
the sense that statistical inference does not precede dynamical time development. Note 
that we rely on the statistical inference since we deal with an enormous number of states 
E n in (2.4). 



3 Statistical Inference and Quantum Statistical Me- 
chanics 

We discuss how to describe near- equilibrium states and their approach to thermal equilib- 
rium in a framework which incorporates the least biased statistical inference. We assume 
that we analyze a physical system which is completely characterized by its total energy 
and particle number, if the thermal equilibrium should be realized for a fixed volume. In 
the representation where the total energy and the particle number are diagonal, we have 
the density matrix 16 ) p(t) which satisfies 8 ) 

Trp(t) =J2p(t) nn = J2pn = l, 

n n 

TrpH = EnPn = (E), 

n 

Trpv = J2vnPn = (N). (3.1) 

n 
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We here assume for simplicity the presence of only one kind of particles. We first note that 
p n is time independent p n = (n\p(t)\n) = (n\e~ lHt / n p(0)e lHt / n \n) = (n\p(0)\n), since the 
total Hamiltonian is diagonal in the present representation. We assume that either none 
of the energy levels are degenerate, or if some of them are degenerate, the density matrix 
p(t) is diagonalized by a (constant) unitary transformation beforehand in each sector 
which contains the degenerate energy levels. Consequently, all the possible off-diagonal 
elements of the density matrix p(t) are time dependent. Our proposal is to define the 
Shannon's amount of uncertainty, which is based on the information available, by 

S = -kJ2Pn\np n . (3.2) 

n 

This S, which carries no characteristic properties of quantum theory, may be assigned a 
purely statistical meaning, and it is time independent. 
The dynamical entropy of von Neumann 

H = -kTrp\np (3.3) 

which is a quantum generalization of Gibbs entropy, is also time independent since the 
time development of p is a unitary transformation. In ref. 9 (and also in ref. 5), this 
entropy H, which in principle contains the effects of quantum coherence, is called the 
information entropy. In this paper we stick to the classical notion of information and 
thus to the amount of uncertainty defined in (3.2). The advantage of choosing (3.2) as 
the amount of uncertainty becomes clear later. The average value of any operator in the 
Schrodinger representation is defined by 

(6(t))=Trp(t)O(0). (3.4) 

In the framework of Shannon's least biased statistical inference used by Jaynes, the 
maximum value of the amount of uncertainty, which is identified with S in (3.2) in the 
present formulation, is considered with the constraints (3.1). For a closed system we are 
considering, the constraints are actually replaced by the conditions (2.4) and (2.5) in 
Section 2. One then obtains the standard result (2.6) for the constant diagonal elements 
of p(t) 

p n = ex V [-P(E)E n ]/Z, 

Z= ]T exp[-P(E)E n ] (3.5) 

E-\AE<E n <E+±AE 

but the time dependent off-diagonal elements of pit) are left completely unspecified, 
namely, we remain maximally noncommittal with regard to missing information. 17 ) 

On the other hand, the maximum of the von Neumann entropy H with conditions 
(2.4) and (2.5) would give rise to 6,7 ' 8 ) 

(n\p(t)\n) = e W [-P(E)E n ]/Z, 

(n\p(t)\m) = 0, n ^ m. (3.6) 
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This density matrix p is completely diagonal; in other words, if one imposes the maximum 
condition on the von Neumann entropy, we arrive at the conventional microcanonical 
ensemble without any freedom of time development. In fact, the conventional analysis 
of statistical mechanics utilizes this property of the entropy H and attempts to prove 
the Boltzmann's H-theorem for (a coarse grained form of) H in (3.3) as an indicator of 
the general tendency toward thermal equilibrium. 8 ' 9,18 ) In contrast, we here attempt to 
characterize the approach to thermal equilibrium from a different perspective. Also, we 
will later suggest that the dynamical density matrix does not approach the static form 
(3.6) even in thermal equilibrium, since the oscillations with microscopic time scales, 
which should be described by the dynamical density matrix, always exist in the system. 

For the general situation, the maximum uncertainty inference of Shannon as formu- 
lated here does not specify the density matrix completely, and the best estimate of the 
average of a general macroscopic operator O is 



where p n is defined in (3.5) and the sum is taken over the available eigenstates in the rep- 
resentation where the total Hamiltonian and particle number are diagonal. This quantity 
is time independent by definition and agrees with the conventional average in thermal 
equilibrium. We will later show that, only after a suitable time averaging, the true av- 
erage (3.4) is well approximated by the conventional thermal average (3.7) if the system 
satisfies certain dynamical properties. It is clear that the least biased inference of Shan- 
non, as formulated here, is purely statistical and does not provide any information about 
dynamical time development. 

We next note an inequality between S and H (see also refs. 5 and 19) 



This relation is shown by using the standard technique of the statistical mechanics 20 ^ . 
This inequality, H < S, valid for any pit) suggests that we can impose the maximum 
amount of uncertainty condition on S without any dynamical constraint on H. (In con- 
trast, if one should use (3.3) as Shannon's amount of uncertainty, the statistical inference 
would precede physical dynamics, since the statistical inference would then determine the 
time dependent off-diagonal elements of p(t) as well.) Besides, H < S shows that we start 
with an initial state with smaller entropy. 12 ^ 

In analogy with the definition of a quantum state in terms of a complete set of commut- 
ing hermitian operators, 22 - 1 we assume that our density matrix p(t), which is a generaliza- 
tion of the Schrodinger wave function, is well specified by a set of macroscopic observables 
{0} , and total energy and particle number; in the present case, the operators {0} do 
not commute with the total Hamiltonian by our assumption. As in classical Boltzmann 
statistical mechanics, n ) where one works exclusively on macroscopic variables, one may 
define macroscopic observables O r in the present framework by the condition 



(6)o = 5>„6(0) nn = J2O(0)nnexp[-P(E)E n ]/Z 



(3.7) 



n n 



H < S. 





(3.9) 



6 



This condition is written in full detail as 

(0 T {t)) = Y,P™n(0)(n\d T \m)expli(E n -E m )t/h] 

m,n 

1 f t+T 

= - dtJ2Pmn(0)(n\O T \m)exp[i(E n -E m )t/h] (3.10) 

T m,n 

and thus the macroscopic observables are not sensitive to the microscopic time (shorter 
than r) dependence of the density matrix. 23 ) Each macroscopic observable in our definition 
is labeled by a characteristic time scale r, which may in general depend on the temperature 
contained in the diagonal components {p n } of the density matrix. 24 ) An operator with 
large r gives a macroscopic observable which agrees with our intuitive understanding: For 
example, r = oo for the total energy or particle number. 

We then define the non-equilibrium state operationally by the relation 

(AO T (t)) = Trp(t)[Or-T,Pn<n\O r \n>] 

n 

= Trp(t)d T -J2Pn<n\6 T \n>^0 (3.11) 

n 

for some macroscopic observables T other than the total energy and particle number. 
Here p n is defined in (3.5). If we do not find any sensible macroscopic observable T which 
satisfies the above relation ( after a suitable time averaging described later ), the system 
is in thermal equilibrium. 

Our system described by the density matrix p(t) then develops with time following 
Schrodinger equation with a fixed value of the von Neumann entropy H . 



4 Second Law in Quantum Statistical Mechanics 

Our next task is to specify what kind of dynamical properties of a many particle system 
ensure that the system with initial conditions defined by our statistical inference will in 
the long run approach the almost equilibrium state. We first note that the time average of 
our p{t) over a sufficiently long period approaches arbitrarily close to the equilibrium p 
with diagonal elements p n in (3.5). In this sense, our system by its construction satisfies 
the Boltzmann's ergodic postulate; the time average behavior of a system is the same as 
its equilibrium behavior. Namely a suitable time averaging of (3.11) gives rise to 

(Ad T ) ~ (4.1) 

which is the (necessary) condition for equilibrium. 

We need to sharpen the time averaged behavior (4.1) to be a sufficient dynamical 
condition for equilibrium. We first recall a quantum version of Poincare's recurrence 
theorem. 25 ^ The theorem states that observables in a (finite) many particle system with 
discrete energy spectrum are almost periodic, namely, after a suitable time lapse the 
system comes back to arbitrarily close to the original configuration. This means that the 
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system has a rather well defined dynamical property and we here deal with those finite 
systems. 26 ) We however assume that the recurrence time for a many particle system is 
sufficiently long by the time scale of our laboratory. The reccurence theorem provides a 
partial justification for the independent specification of diagonal elements of p in (3.5) 
and the non-diagonal elements in (3.11), since an apparently thermal equilibrium state, 
which is related to (3.5), can come back to any original starting configuration. 

To ensure the thermal equilibrium, we need to avoid the persistent synchronized col- 
lective oscillation even if the averaged behavior (4.1) is satisfied. We expect that the 
probability for a great number of oscillators in p(t) to synchronize persistently is negligi- 
bly small for a system of a many particle system. In the generic situation, our system is 
expected to give a negligible time correlation between (AO T (t)) for a large time difference 
\t\ — £2!) which is however very small compared to the recurrence time. 

The averaged behavior (4.1) is now sharpened to be a stronger dynamical postulate as 



see also (3.11)) 

— 1 rt+At c 

\(AO T )(t,At c )\ = \ — J dt(AO T (t))\ 



I rt+At c 

— / dtTrp(t)0 T — y^p n < n\0 T \n > 

16^ J t 



'At c j t 

« KO T )o| (4.2) 

for any macroscopic observable T and a fixed finite At c ; (O r )o — UnPn < n\0 T \n > \ 
is defined in (3.7) with p n in (3.5). For At c = 00, the left-hand side of this relation 
vanishes by our construction. We impose this condition for a finite At c and assume 

that (AO T )(t, At c ) is not sensitive to the absolute value of t and a small variation of 
At c . Physically, this condition means the existence of a well-defined relaxation time for 
a set of macroscopic observables. The actual magnitude of At c , which is expected to 
be microscopically quite long and of the order of macroscopic time scale, will generally 
depend on the specific system we are analyzing. In terms of the parameter r in (3.9), we 
expect (for an operator with a finite r) 

t < At c < 00. (4.3) 

To observe the relaxation in terms of the macroscopic observables T) we need to have 
r < At c . 

Note that the condition (4.2) does not contradict the reccurence theorem: 25 ) In prac- 
tice, after the initial relaxation, it might be that one can take At c ~ r, namely, one 
cannot recognize the sizable deviation from thermal equilibrium by the time resolution of 
macroscopic observables. However, if the reccurence occurs, one need to wait for the time 
~ At c for the system to relax again. 

Since 

(Or(t)) = J2Pmn(0){n\d T \m)e^[i(En-E m )t/h] (4.4) 

m,n 

we have only the near diagonal components after the above time averaging (4.2); namely, 
only the terms with 

\E n - E m \<2nh/At c (4.5) 
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survive the time averaging. Note that 2nh/At c is very small compared to AE in (3.5) for 
a many particle system we are interested in. 27 ) Also the real time-independent diagonal 
components are subtracted in (AO T (t)). In the remaining terms we have a sum of complex 
amplitudes p mn (0){n\O T \m) with nearly equal frequencies (E n — E m )/h: Our condition 
(4.2) is that the sum of oscillating quantities are either absent or destructively interfere 

I E p mn (0)(n\d T \m)ex P {i(E n -E m )t/h]\ < |(O r )o| (4.6) 

o<| En E m 

for a general value of t. 

As a concrete example of our analysis, we illustrate the problem of a gas confined in the 
left-half of a box and then removing the partition, although this probelm corresponds to 
the case of far from equilibrium. 28 ** We make a statistical inference to fix p n on the basis 
of the information about all the possible energy spectra of the entire box, the average 
energy and particle number. The macroscopic observable T (x) may be chosen as the 
particle number density (in a suitably smeared sense) inside the entire box. We define 
(O T ) (x) by using the result of the above inference and (3.7). The observable 

Trp(t)AO T (x) = Trp(t)6 T (x) - (O T ) (x), (4.7) 

which has a positive peak in the left-half of the box and a negative peak in the right-half 
of the box at t — 0, is described by choosing the off-diagonal time dependent elements of 
p(t) suitably. Our inference agrees with the conventional answer of statistical mechanics, 
if the possible macroscopic oscillation in Trp(t)AO T (x) diminishes soon. 

To analyze the effects of time averaging, one may consider a simpler example of the 
average position of particles (instead of the particle number density) 

= X = J2^/N (4.8) 

i 

of the gas confined into the left-half of the box at t — 0, which is one of the indicators 
of the macroscopic motion of particles. One may make a crude estimate of the typical 
frequency contained in Xi by considering the matrix element (n + l\xi\n) for a free particle 

as 

u ~ (hn(n + l)) 2 / \2mL 2 Ti) - (kirn) 2 / (2mL 2 h) ~ (him)/(mL 2 ) ~ v/L (4.9) 

where v = (hnn)/mL is the typical velocity, which is determined by the temperature 
appearing in the diagonal elements p n . Here L is the size of the box. It is then unlikely 
that (0(t)) contains the sizable components with frequency 

lo = \E n - E m \/h < 2n/At c (4.10) 

to survive the time averaging for At c ^> L/v (or equivalently, 2n/At c 2nv/L), except 
for the static diagonal elements. For this choice of O and the crude estimate, the gas 
which was in thermal equilibrium and confined into the left-half of the box at t — 
satisfies the condition (4.2) for At c 3> L/v. The relaxation time in the present example 
is determined by the typical transport time scale r ~ L/v, provided that a soliton-like 
persistent collective motion does not occur. 
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4.1 Entropy law of Clausius 



The physical entropy of the final thermodynamic state defined by this time averaging 
(4.2), which is characterized by At c , is estimated by 

H(At c ) = -kTrp In ~p (4.11) 

with 

rt+At c 

p=(l/At c )J p{t)dt (4.12) 

for a generic value of t. This value is expected to be close to the maximum of Shannon's 
statistical amount of uncertainty S, which is the maximum value of any sensible definition 
of entropy because of (3.8): Note that 

S>H(At c ) (4.13) 

since p nn = p(t) nn = p n and Trp = 1, which are sufficient to prove (3.8). We interprete this 
approach of H(At c ) to S as a manifestation of the entropy law in the present formulation of 
quantum statistical mechanics. 29 ) We reiterate that we used two ingredients to formulate 
the entropy law in our approach : The first is the statistical input related to the least 
biased inference on the basis of a limited amount of information available (3.5), and the 
second is the dynamical input related to the time averaging in (3.9) and (4.2). 

The von Neumann entropy H is in contrast rigidly defined by the basic dynamics, 
and it does not allow any arbitrary manipulation such as taking a time averaging of p{t) . 
The dynamical entropy H of von Neumann stays constant throughout the unitary time 
development of the system regardless of our time averaging procedure. In fact, the above 
time averaging (4.2) to define the physical thermodynamic state resolves the discrepancy 
of the dynamical von Neumann H and the physical statistical entropy (4.11) we defined. 
Physically, the von Neumann entropy H is sensitive to the dynamical motion of all the 
time scales in the system, whereas the thermodynamic entropy (4.11) is not sensitive to 
the motion with time scales shorter than ~ At c ; moreover, the condition (4.2) states that 
the macroscopic motion with time scales larger than ~ At c in the system is negligible. 

We here note an interesting analogy of the present formulation of physical entropy 
with the renormalization group in field theory. The parameter At c (or to be precise, 
h/At c ) characterizes the energy scale of the theory, and the entropy H(At c ) = —kTrphi p 
in (4.11) corresponds to the renormalized running coupling constant. The ultra-violet 
limit At c — > gives rise to the von Neumann entropy H, which corresponds to the bare 
coupling constant, the fundamental quantity defined by the basic dynamics , namely, 
quantum mechanics in the present case. But physics is not sensitive to the bare coupling 
constant. The infrared limit of H for At c — > large gives rise to the measurable quantity, 
the maximum of Shannon's S, corresponding to the coupling constant a = 1/137 in QED 
defined in the Thomson limit. In this analogy, the entropy law of Clausius corresponds 
to a statement of the existence of a stable infrared fixed point for finite At c . 

This picture also suggests that the dynamical density matrix p(t) does not approach 
the static diagonal form in (3.6) even in thermal equilibrium, since the oscillations with 
microscopic time scales always exist. The microscopic time dependence of the equilibrium 
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density matrix is also expected in the conventional Gibbs ensemble theory, if one defines 
the thermal equilibrium by macroscopic observables T with well defined time scale r. 
Our definition of the macroscopic observables T in (3.10) shows that the macroscopic 
observables are not sensitive to the possible microscopic time (shorter than r) dependence 
of the equilibrium density matrix. 

Our picture is expected to have an implication on the linear response theory for time 
dependent current correlations 30 -* with frequencies which are comparable to ~ 1/r, since 
the possible time dependence of the equilibrium density matrix is not ignored for such a 
case. In the applications of the fluctuation-dissipation theorem, one usually examines the 
correlation functions of operators averaged with the static diagonal equilibrium density 
matrix e~^ n /Z. The possible microscopic time dependence of equilibrium density matrix 
may lead to interesting implications on the analysis of the foundations of linear response 
theory. 

5 Discussion 

The purpose of this note has been to discuss quantum statistical mechanics which incor- 
porates Shannon's statistical inference, and to analyze a resulting picture for the second 
law. Our statistical inference does not resolve the basic issue why the statistical inference 
works for a many particle system, but our statistical inference , unlike the equal a priori 
probabilities, allows an analysis of the dynamical aspects of the second law. 

Our picture for the general tendency toward thermal equilibrium is quite different from 
the one in the conventional formulation of quantum mechanical H-theorem: 8 ' 9,18 ) We note 
that the conventional coarse-grained approach performs not only the statistical operation 
by setting the diagonal elements in each subsector of the density matrix to be equal by 
assuming equal a priori probabilities, but also the dynamical operation by setting the 
off-diagonal time dependent elements to be zero to let the entropy increase. 

We now briefly comment on the two ambitious approaches to the second law on the 
basis of purely dynamical considerations, to be compared with our more conservative 
statistical analysis. The first is the approach of Tasaki 31 ) and the second is the approach 
of Van Hove: 32 ) 

Tasaki analyzed a possible quantum mechanical derivation of a canonical ensemble 
starting from a pure quantum state. The basic idea is to consider two quantum systems, 
a bath \a) and a subsystem \a), and one then introduces a suitable interaction between 
them so that one obtains a pure quantum state for the combined system, 

a, a 

which has a vanishing von Neumann entropy. One then traces out the bath system in the 
density matrix 

Ptot = I^XVI - Psub = Y,(M^M) ab \b)(a\. (5.2) 

a,b 
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Because of the quantum entanglement of two systems, one then obtains a mixed state for 
the subsystem which has a non-vanishing entanglement entropy 33 ) 

H sub = -kTrp sub In p suh > 0. (5.3) 

To reproduce a canonical ensemble for the subsystem, Tasaki assumes the "hypothesis of 
equal weights for eigenstates" for the combined system. He also argues the cancellation of 
oscillating non-diagonal components of the density matrix for the subsystem at sufficiently 
large t by using Chebyshev's inequality. 31 ) If one can show that the "hypothesis of equal 
weights for eigenstates" holds for a rather general class of dynamical systems, the quantum 
entanglement entropy would provide a physical explanation of the statistical entropy. At 
this moment, it appears that the generality of the hypothesis has not been established. 

In the analysis of entanglement entropy, a clear distinction between the von Neumann's 
H, which is equal for both of the bath and the subsystem 33 ) H bath = H sub and thus not 
extensive, and the Shannon's S, which could be vastly different for the bath and the 
subsystem and thus could be extensive, is expected to be essential. 

Van Hove 32 ) analyzed the possible approach of a general mixed state to micro canonical 
states by a dynamical time development in the limit t — > oo. To be specific, he starts 
with a mixed state 

p(0) = / da\a)\c a \ 2 (a\ (5.4) 



where Hq\q) — e a \a) , and the unitary time development generated by 

U(t) = exp[-i(H + XV)t/h]. (5.5) 

His major claim is ( see Eq. (1.4) in the first paper in ref. 32) 

TrdU(t)p(0)U'\t) -> (0) mcrocanonical (5.6) 

for t — > oo. Since the operator U(t) is unitary for whatever large but finite t, both of 
the von Neumann's H and Shannon's S remain at the initial value different from the 
microcanonical value. Van Hove however considers a singular limit X 2 t — > constant for 
t — > oo (i.e., Xt ~ 1/A — > oo for t — > oo). In such a limit, if literally taken, the ^-matrix 

(and consequently perturbation theory he uses) is not defined, since the condition of an 
adiabatic switch-on and switch-off of the interaction is not satisfied. See also ref. 34. If 
one can provide a mathematical basis for the singular limit, one would be able to derive 
the microcanonical ensemble from a general mixed state by a unitary time development. 
At this moment, to our knowledge, such a mathematical basis appears to be missing. 

Finally, we mention recent activities on an alternative approach to the second law. 
Jarzynski 35 ) found the following amusing identity 

= — f dp{z')e-P Ho{z) -P {Hl{z,) - Ho{z)) 

Zq J 

= — [ dutfe-WoW-Wi*) = ( e -W*)) (5.8) 

Zq J 
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where we defined the mapping 

* = (q(to),p(to)) - = (q'(ti),p'(ti)) (5.9) 

as the canonical transformation generated by the time dependent Hamiltonian (a quantum 
version is also known 36 )) 

H X (t), t < t < (5.10) 

with 

A(t o ) = 0, A(*i) = l. (5.11) 

The Liouville theorem dfi(z') = dfi(z) is essential in the above identity. We also defined 
the work done during the time development by W(z) = Hi(z') — H (z). 

If one defines the Helmholtz free energy for the system described by Hamiltonian Hi 
at temperature f3 by Fi(f3) , one obtains from (5.8) 

AF(P) = Fi{(5) - F (P) = -Iln(e-^W). (5.12) 

and, by noting the mathematical inequality (exp[A(z)]) > exp[(j4(;z))], 

AF((3) < (W) (5.13) 

which resembles the basic thermodynamic inequality, 6 ^ an alternative expression of the 
second law. The identity (5.8) as it stands is equivalent to the Liouville theorem and 
thus contains no information about the thermal entropy generation. In fact, it is known 
(from an explicit analysis of harmonic oscillators, for example) that the equality sign in 
(5.13) does not hold for an infinitely slow adiabatic work 35 -*, which is by itself consistent: 
But to analyze the equality sign in (5.13), one need to analyze the approach of a system 
in (5.8), once driven out of equilibrium by an external work, to thermal equilibrium 
again. The analysis of Jarzynski is thus complementary to the analysis in the present 
note, and certainly it does not replace our analysis. See recent works 37 ' 38 related to the 
above identity. See also Lenard 39 -* for an analysis of similar inequality associated with a 
canonical ensemble. 40 -* 

In conclusion, we have illustrated a physical picture of the second law when one makes 
a clear distinction between statistical aspects and dynamical aspects in statistical me- 
chanics, which is made possible if one uses S in (3.2). Our basic view as presented here is 
rather conservative, namely, it is based on the premise that the entropy law of Clausius is 
not a direct consequence of microscopic dynamical laws alone. In the context of classical 
Boltzmann approach, this view appears to be shared with experts just to quote " It fol- 
lows that the macroscopic dynamics cannot be a consequence of the microscopic dynamics 
alone". 10 '' The remaining basic issue in the present approach is to specify precisely the 
class of many-particle Hamiltonians which ensure (4.2). 

I thank H. Tasaki for numerous clarifying comments, and A. Shimizu and M. Ueda 
for helpful comments at the initial stage of this work. 
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